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Abstract 

Two seemingly unrelated problems, scheduling a multiclass queueing system and min- 
imizing a submodular function, share a rather deep connection via the polymatroid that 
is characterized by a submodular set function on the one hand and represents the perfor- 
mance polytope of the queueing system on the other hand. We first develop what we call 
a grouping algorithm that solves the queueing scheduling problem under side constraints, 
with a computational effort of 0(n 3 LP(n)) 1 n being the number of job classes, and LP(n) 
being the computational efforts of solving a linear program with no more than n variables 
and n constraints. The algorithm organizes the job classes into groups, and identifies the 
optimal policy to be a priority rule across the groups and a randomized rule within each 
group (to enforce the side constraints). We then apply the grouping algorithm to the sub- 
modular function minimization, mapping the latter to a queueing scheduling problem with 
side constraints. We show the minimizing subset can be identified by applying the grouping 
algorithm n times. Hence, this results in a algorithm that minimizes a submodular function 
with an effort of 0(n A LP{n)). 

1 Introduction 

Linear optimization over a polymatroid — a polytope that possesses matroid properties — is 
known to be solvable through a greedy algorithm; refer to Edmonds [10], Dunstan and Welsh [9], 
and Welsh [32] . It is also known that the performance space of a multiclass queueing system is a 
polymatroid, provided the system satisfies strong conservation laws; refer to Shanthikumar and 
Yao [30] , and also the earlier works of Coffman and Mitrani [6] , and Federgruen and Groenevelt 
[12, 13]. With this connection, the control or dynamic scheduling of multiclass queues can 
be translated into an optimization problem; and under a linear objective, the optimal control 
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is readily identified as a priority rule associated with one of the vertices of the performance 
polytope. 

In many scheduling applications, there are additional, "side constraints" on the perfor- 
mance; for instance, the delay of jobs should not exceed a certain limit, or the completion rate 
(throughput) of jobs must meet a certain requirement. The dynamic scheduling of multiclass 
queues under such side constraints is still polynomially solvable, as argued in Yao and Zhang 
[33], directly applying a result by Grotschel, Lovasz and Shrijver [19]. This argument, however, 
is essentially an "existence proof." It does not explicitly identify an optimal policy, let alone 
one with a simple structure that appeals to implementation. 

In this paper, we develop a combinatorial algorithm that identifies the optimal policy for 
scheduling multiclass queues with side constraints. This is what we call a "grouping algorithm," 
which partitions the n job classes into groups, and the associated optimal policy is a priority 
rule across the groups, while enforcing the side constraints (via randomization, for instance) 
among the classes within each group. The grouping algorithm identifies the optimal policy with 
a computational effort of 0(n 3 x LP(n)), where LP{n) refers to the complexity of solving a 
linear program with number of variables and constraints bounded by n. 

It turns out that this grouping algorithm can also be used to solve the problem of minimizing 
a submodular function — finding the subset that minimizes a set function, V : 2 E i— > 1Z, 
which maps the subsets of a given set E to the real line and satisfies submodularity (refer to 
Definition 1 (iii)). Here the key ingredient is that a certain representation of the submodular 
function minimization can be turned into a queueing scheduling problem with side constraints, 
with the submodular function relating to the polymatroid that is the performance polytope of 
the queueing system. We show that the minimizing subset can be identified by applying the 
grouping algorithm n = \E\ times; each time, it identifies a subset, and only these subsets are 
candidates for optimality. Hence, the overall computational effort is 0{n A LP{n)). 

Submodular function minimization (SFM), widely regarded as "the most important problem 
related to submodular function" ([20]), has attracted much research effort in recent years. Here 
we briefly highlight the relevant literature, as the details are now available in two recent survey 
papers by Fleischer [14] and McCormick [27]. 

Using the ellipsoid approach, Grotschel, Lovasz and Shrijver [19] established that the SFM 
is polynomially solvable. However, the ellipsoid technique is not usually a practical algorithm, 
and it does not generate much combinatorial insight. Cunningham [7] proposed a max-flow type 
of approach based on linear programming (LP), which results in a combinatorial algorithm that 
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solves the SFM in pseudo-polynomial time. (The qualifier "pseudo" refers to the fact that the 
running time is not only a polynomial in n, but also a polynomial in M, an upper bound of 
the set function.) Recently, combinatorial algorithms that solves SFM in strongly polynomial 
time (meaning, the running time is a polynomial in n only, and is independent of M) have been 
independently developed by Iwata, Fleicher and Fujishige [23], and by Shrijver [31]; and further 
improvement has appeared in Iwata [22, 21], and Fleischer and Iwata [15]. The starting point 
of these algorithms is an equivalent problem over a polymatroid with side constraints, and this 
LP is solved via an augmenting-path approach. In contrast, our approach to the SFM is to 
solve a sequence of n LP problems; each is an LP over a polymatroid with side constraints, but 
the inequalities in the side constraints are in the opposite direction to those in existing SFM 
algorithms. None of these n LP's is equivalent to the SFM, but solving all of them does yield 
the solution to the SFM. More importantly, each such LP can be represented as a queueing 
scheduling problem highlighted above, and hence can be solved by the grouping algorithm. 
(Refer to more details in §5, Remarks 1 and 2 in particular.) The advantage of our algorithm is 
that it is fully combinatorial (meaning, it only uses addition, subtraction and comparison); and 
its running time of 0(n 4 LP(n)) matches that of the strongly polynomial version of the ellipsoid 
algorithm, and is significantly better than the fastest existing combinatorial algorithms. (For 
instance, the one due to Iwata [21] has a running time of 0(n 7 ); refer to [27].) 

Briefly, the rest of the paper is organized as follows. In the next section, we review pre- 
liminary materials for linear optimization over a polymatroid, and its connection to ququeing 
scheduling via strong conservation laws. In §3 we collect the essentials of the grouping algorithm 
that solves the queueing scheduling problem with side constraints, in particular the conditions 
that will ensure its feasibility and optimality. This is folllowed by a detailed description of the 
algorithm in §4, where we also show that the required conditions outlined in §3 are all met. In 
§5 we apply the grouping algorithm to the SMF problem. Finally, in §?? we consider a special 
case when the submodular function satisfies the so-called generalized symmetry. 

The polymatroid connection to queueing scheduling has in recent years been extended to 
a more general framework via so-called generalized conservation laws; refer to the papers by 
Bertsimas and his co-workers, e.g., [1, 2, 3]; also refer to Lu [26], Zhang [34], and Chapter 11 
of Chen and Yao [8]. In this paper, however, we choose to stay within the more restrictive 
framework of polymatroid, since our emphasis here is not on queueing scheduling per se, but 
rather on the interplay between such scheduling problems under side constraints and minimizing 
submodular functions. 
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2 Preliminaries 

2.1 LP over a Polymatroid 

Throughout, E = {l,...,n} is a finite set; x = (xi)i e E is a vector, and x(A) := J2ieA x i 
for any A Q E. We shall use "increasing" and "decreasing" in the non-strict sense, to mean 
non-decreasing and non-increasing, respectively. 

Definition 1 (Welsh [32]) Given E = {1,2, ...,n}, the following polytope, 

V(f) = {x>0:x(A)<f(A),ACE}, (1) 

is called a polymatroid if the function / : 2 E — > 1Z + satisfies the following properties: 

(i) (normalized) /(0) = 0; 

(ii) (increasing) if AQB CE, then f(A) < /(B); 

(iii) (submodular) if A, B C B, then /(A) + /(B) > /(A U B) + /(A n B) 

Let 7r be a permutation of {1, 2, n}, and define a vector x 71 " with the following components: 

< = /({*!», 

< 2 = /({TTl.TTa})-/^!}), 

(2) 

^„ = f({ni,Tr2,-,Kn}) -/({7ri,7r 2 ,...,7T n _i}). 

Then, we have another definition for polymatroid: 

Definition 2 The polytope "P(/) in (1) is a polymatroid if x 71 E V(f) for any permutation it. 

That the two definitions, 1 and 2, are equivalent is a classical result; refer to, e.g., Edmonds 
[10], Welsh [32], and Dunstan and Welsh [9]. The following theorem, also a classical result, can 
be found in these references as well. 

Theorem 1 ([9, 10, 32]) Consider the LP problem: max^g-p^) J2ieE °i x i- Suppose tt = (7Ti)i^E 
is a permutation, such that 

Then, x n as specified in (2) is the optimal solution. 



For the polymatroid V(f), we call the following polytope its base: 

B(f) = {x > : x(A) < f(A), A C E; x(E) = f(E)}. (3) 
In queueing applications, it is useful to have another type of base, in addition to the one in 

(3): 

B(b) := {x > : x(A) > 6(A), A C E; x(E) = b(E)}, (4) 

where b(-) is increasing and supermodular, with 6(0) = 0. It is straightforward to verify that 
B(b) = B(f) by letting f(A) = b(E) — b(E — A). Since the resulting / is submodular, increasing, 
and /(0) = 0, we know B(b) is also the base of a polymatroid. In particular, each of its vertices 
can be expressed as x n as in (2), with /(•) replaced by &(■). 

2.2 Polymatroid Structure in Multiclass Queues 

Let E = {1,2, ...,n} denote the set of all job classes (or, types) in a queueing system. Let 
x = (xi)i£E denote the performance vector, e.g., waiting time (delay) or service completions 
(throughput). Let U denote the family of all admissible policies that control the way jobs are 
served in the system. This includes all policies that are non-anticipative (i.e., no clairvoyance) 
and non-idling (i.e., serve whenever there is a job to be served). We have: 

Definition 3 (Shanthikumar and Yao [30]) A performance vector x of a stochastic system 
with a set of multiple job classes, E = {1, ...,n}, is said to satisfy conservation laws, if there 
exists a set function b (resp. /): 2 E i— > 1Z + , satisfying 

&(A) = 5> Wi) V7r:{7ri,...,7T| A |} = A, VACE; (5) 



or respectively, 



f(A) = ^ X7Ti , V7r:{7ri,...,7T| A |} = A VA C E; (6) 



such that under any admissible control, u GW, the following is satisfied: 



or respectively, 



5>«>&(A), WLc£; 5>« =&(£); 

ieA ieE 



^2x1 <f (A), \/A C E; 5>«=/(i5). 

ieA ieE 
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Intuitively, the above definition says: 

• The total performance over all job classes in E is invariant: it is a constant that is policy 
independent. 

• The total performance over any given subset, A C E, of job types is minimized (e.g., if 
x is delay) or maximized (e.g., if x is throughput) by offering priority to job types in A 
over all other types in E — A. 

• The function b(A) or f(A) is the corresponding performance when A is given priority over 
E — A. Furthermore, the permutation it represents a priority rule such that ix\ is given 
the highest priority; 7T2, the second highest priority; and so forth. And the requirement, 
W : {7ri, 7T|A|} = A, signifies that b(A) and f(A) depend only on the composition of 
the job types in A; they are independent of the priorities among these job types. 

The above definition of conservation laws, along with Definition 2, immediately leads to 
the important fact that when conservation laws are satisfied, the performance space over all 
admissible controls is a polymatroid — in fact, the base polytope of a polymatroid, and the right- 
hand-side function is determined by the conservation laws too, i.e. the performance resulting 
from assigning priority to certain subset of jobs. This is simply because any ir, a priority rule, 
is admissible, and hence, x w belongs to the performance polytope, and x w following (5) and (6) 
are of the same form as x w in (2). 

Suppose we want to find the optimal control — a dynamic scheduling rule — of the n job 
types in the set E, so as to maximize an objective function with respect to a performance vector 
x. The problem can be expressed as follows: 

max J2°i x i' ( 7 ) 

with q's being the costs. Suppose x satisfies conservation laws; and hence, the performance 
space under all admissible controls is the base polytope of a polymatroid, denoted B(f). So we 
can first solve a linear programming problem: 

max CjXj. 

From Theorem 1, we know the optimal solution is x^*, where ir* is a permutation that orders 
the cost coefficients in decreasing order: 
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We can then identify a control u* that realizes this performance. The fact that the solution 
being a vertex of the performance space ensures that the optimal policy is a priority rule. It is 
clear that to derive the optimal policy, there is no need to compute the right-hand-side function 
of the performance polytope. For more details regarding conservation laws and the connection 
to polymatroid, including many examples, refer to Chapter 11 of Chen and Yao [8]. 

3 Side Constraints: Feasibility and Optimality 

Consider the polymatriod V(f) in (1). Suppose there are additional constraints: Xj > d; L > 0, 
i G E. (For instance, di is the requirement on the throughput of type i jobs.) Adding these to 
V{f) results in the following polytope: 

V'U) = {x(A) < f(A),A CE;x t > d t ,i G E} (8) 

We want to solve the following LP problem: 

n 

max CkXu. (9) 

For the LP in (9) to be feasible, the following condition is readily verified. 
Lemma 2 (Ross and Yao [28]) The polytope V(f) is non-empty if and only if d := (di, d n ) G 

r(f). 

Below, we shall assume that d G V(f) always holds. A direct application of the following 
result establishes that the LP in (9) is polynomially solvable. 

Theorem 3 (Grotschel, Lovasz, and Schrijver [19]) Let K, be a class of convex bodies. There 
exists a polynomial algorithm to solve the separation problem for members of /C, if and only if 
there exists a polynomial algorithm to solve the optimization problem for the members of fC. 

Making use of this result, we know that the membership test problem (which is equivalent to 
the separation problem) over V(f) is polynomially solvable, since the LP over V(f) is. Hence, 
the membership test over T"{f) is also polynomially solvable, since this amounts to checking 
the additional n side constraints. Therefore, the LP over V(f) is also polynomially solvable. 

However, the argument in [19] is based on an ellipsoid approach. In contrast, below we 
develop a fully combinatorial algorithm that solves the LP over V'(f) in 0(n 4 ) steps, and also 
reveals the structure of the optimal solution. 
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Without loss of generality, assume the coefficients in (9) are ordered as follows: 

Ci > C 2 > • • • > Cn > 0. (10) 

The algorithm below divides the n variables into m < n groups: Gi,...,G m . Denote, 
G(°) = 0; and 

G (i) :=GiUG 2 U---UGi, i = l,...,m. 

Within each group Gj, there is exactly one "lead" variable, indexed li, which takes on a value 
xe { > d£ t ; the rest of the group, denoted Gf (possibly empty), consists of variables that all take 
values at the constraints, i.e., 

Xki=d kl , h£Gf, i = l,...,m. (11) 

Furthermore, the lead variable takes the following value: 

x u = f(G^)-f(G^)-d(Gf), i = l,...,m; (12) 

where we have used the notation d(A) := J2ieA di, for A = Gf . 

Lemma 4 The solution x as specified in (11) and (12) is feasible, i.e., x G V(f), if for each 
i = l, ...,m, we have: (a) x^ > d^, and (b) 

x(A)<f(G^;A)-f(G^), AdG % . (13) 

When the above do hold, we have 

x(G^;A)<f(G^;A), A c Gf, x(G«) = /(G«); (14) 

for all i = l, m. 

Proof. First, all side constraints are satisfied, given (a), along with (11). Also note that from 
(12), since x(Gi) = xe i + d(Gf), we have 

x(G l ) = f(G^)-f(G^), 

which, along with (13), leads to the relations in (14). 

Next, we show x(A) < f(A), for any AC E. Write A = U™ X A, with D t C d, i = 1, ...,m. 
Similar to the G^> notation, denote Z?W := U' l k=1 D k , and := 0. Then, from (13), we have 

x(Di) < /(G^^DA-fiG^) 
< _ /(I> «-i)) 
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where the second inequality follows from submodularity. Hence, summing over all i, taking into 
account that D^s are non-overlapping (since the GVs are), we have 

m 

x(A) = Y / x(D i )<f(D^) = f(A). 
i=i 

□ 

Theorem 5 The feasible solution characterized in Lemma 4 is optimal if the indices of the 
lead and nonlead variables satisfy the following relations: 

lj < tj+i, j = 1, —,m - 1; Zj<kj, kj G Gj, j = 1, ...,m. (15) 

Proof. The dual of the LP in (9) takes the following form: 

mm E VAf{A) ~ E diZi ( 16 ) 
ACE i£E 

s.t. E^M - Zi > a, i £ E; 

A3i 

VA>0, Zi>0; AQE, i £ E; 

where tja and Z\ are the dual variables (the latter corresponds to the side constraints). 
Following (11) and (12), the primal objective value is: 

m m 

E(% - c ej+1 )f(G^) - E E (% ~ ( 17 ) 

where Q m+1 := 0. 

Now, let the dual variables take the following values: 

VA = ctj - ce j+1 , A = G { - 3 \ j = l,...,m; 

let all other = 0; let 

Zkj = c tj - c kj , kj G Gj, j = 1, m; 

and let all other ^ = 0. Then, the non-negativity of these dual variables follows from (15), tak- 
ing into account (10). Furthermore, consider the h-th constraint in the dual problem. Suppose 
h G Gj. If h is a lead variable, i.e., h = £j, then h G C?W for all i > j. Hence the constraint 
becomes: 

E( c ^ - c 4+i) = % = c h- 

i>3 
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If h is a non-lead variable, i.e., h = kj G Gj, then we have 

Zh = z kj = c tj - c kj = c ej - c h , 
as well as h <G for all i > j. In this case, the constraint becomes: 

The complementary slackness condition is also readily checked. (In particular, the dual 
objective is equal to the primal objective in (17).) 

Finally, primal feasibility has already been established in Lemma 4. Hence, the optimality 
of both primal and dual solutions. □ 

4 The Grouping Algorithm 

The grouping algorithm to be presented below forms the groups to maintain feasibility. In 
particular it maintains the distinction of non-lead variables and lead variables in (11) and (12), 
and enforces the two conditions in Lemma 4 — that the lead variable of each group satisfies 
the side constraint, and that (13) holds for each group. Furthermore, we introduce a grouping 
structure 1 within each group to form a hierarchical structure that is crucial for the feasibility 
proof, as well as track the order of variables entering the group. Optimality follows, as argued 
in Lemma 4 and Theorem 5 above. 

Although not essential to understanding the algorithm, it would help intuition to view the 
groups as associated with different priorities, with G\ having the highest priority, and G m the 
lowest priority. The variables can be viewed as throughput associated with each class, whereas 
the side constraints represent the minimal requirements on the throughput. The lead variable in 
each group contributes to maximizing the objective function, a weighted sum of the throughput 
of all job classes. The non-lead variables are non-contributors; as such, they are moved into a 
higher-priority group parsimoniously just to satisfy their respective side constraints. 

Inductively, assume we know how to do the grouping when \E\ = n — 1. Specifically, suppose 
this results in m — 1 groups, G±, G m -\. Within each group Gi, we have, 

• lead variable £f, 

• subgroup g], j = !,■■■ ,jf, 

1 Lack a better choice, we name them subgroups. However, there is no connections between our problem and 
the algebra entities of group and subgroup 
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and they satisfies, Mkj <G g l j, 

f(G^;g[, ■ ■ ■ , sj-i) + d k] < f(G^;g\, • • • , g)_ 1: kj) (18) 

and there exists at least one ke 6 g l j such that, 

f{G^ 1 \g\, • • • , fcj, k e ) - /(G^ 1 ), ffj, ■••,(/}_!, 

<4, (19) 

and, 

/(GW)-/(G( i - 1 ))-d(Gf)>^ i . (20) 

Now consider the added n-th dimension; in particular, we want to find where to put the 
new element n and what value to assign to x n . In the algorithm presented below, this is 
accomplished in Step 1. Then, we need to adjust the newly generated group and the lower 
priority groups that are affected (by the new group) so as to maintain feasibility; and this is 
done in Steps 2 and 3 of the algorithm. 

To start with, we let n itself form the m-th group. 

The Grouping Algorithm 
Step 1. {Group Identification) 

• Find the largest i < m, such that 

f(G^)+d n <f(G^;n), (21) 

and 

f(GW) + d n >f(G®,n). (22) 

Step 2. (Subgroup Identification) Within group G % , find the largest j, such that 

f(G^ 1 \g[, ■■■J j _ 1 )+d n < f(G^-g\, • • • , g^n), (23) 
form new subgroups according to the validity of the following inequality, 

f(G^~ 1 \g[,---,g i j_ 1 ,n) + d kj < f{G^~ V> ;g[,---,g)_ l ,n,kj), (24) 
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• if for all kj € g l j, (24) holds, then, n will becomes new subgroup g l j, and the indices for 
all the previous subgroups that have index larger and equal to j will have to increase by 
one; 

• otherwise n will join the subgroup g l j. 

Step 3. (Subgroup Validation) This step is to determine if there is a feasible solution that 
the variables in subgroup g l j are valued at the side constraints. This is equivalent to solve the 
membership problem for vector (d kl ,dk 2 , • • • ,d ke ) to the polymatroid defined by the following 
submodular function, 

g(A):=f(G^- 1 \g[ r --,g)_ 1 ,A)-f(G^- 1 hg[r--,4-i) ( 25 ) 
i.e. the subgroup is valid if the following LP has feasible solution, 

mmx kl ,s.t.x £ P(g),x ki > d ki (26) 

If the LP in (26) is founded to be infeasible, then this subgroup is invalid, hence to be dissolved. 
Then if j > 1, then we need to go back to Step 2 to identify the subgroup for all the member 
previously in g l p if j = 1, the group G l need to be dissolved, go back to Step 1 for group 
identification for each member previously in G l . Step 4. 

• Conduct validation step for subgroup <7* +1 , <7j +2 ; 

• Repeat the procedure to all the variables that has been bypassed; 

Theorem 6 The grouping algorithm generates a solution that follows (11) and (12), and sat- 
isfies both feasibility requirements (a) and (b) of Lemma 4, and the inequalities in (15) of 
Theorem 5. Hence, it is an optimal solution to the LP in (9). 

Proof. We only need to check the feasibility. To do this, we need to check if our output satisfies 
equation (13). For any icGj, let us consider two cases, (i) ^ G Gj - A; (ii) ti G A. 
Case I: Certainly, we have, A = \Jj =1 Aj, with Aj = Adgj. From the validation step, we know 
that for each j, 

x(A j ) = d(A j ) < f(G^- 1 \gi,---,g}_ 1 ,A j )-f(G^;g\,---,gi_ 1 ) 
Due to submodularity, we have, 

x(A j ) = d(A j ) < f(G^- 1 \A 1 ,---,A j _ 1 ,A j )-f(G^;A 1 ,---,A j _ 1 ) 
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It is true for for each j, then, sum them up, it implies that (13) is true. 
Case II: 

For any subgroup let us denote k{, k 3 2 , ■ ■ ■ , k 3 h of its member according to the order they 
entered the subgroup. Hence, the inequality (22) must be satisfied in the following form, 

d k{ > f(G^ ] g\ r --,g)_ l] l n k{)-f(G^ ] g\,--- 1 g)_ 1 -,e i ), 

fy a > f(G^~^;g{, ■ • • ,gj_i,£i, k{,g\) - f{G^~ V) ;g\, ■ ■ ■ ,gj_i,£i, k{), 

d y > f{G^ i ~ v> \g[, ■ ■ ■ ,gj_i,£i, k{, . . . fcjj-i> kP h ) - f(G^~^;£i,g{, . . . g{-i); 

h 

Because when k\ enters the group, G(i) is made of G^ _1 ), subgroups g\, ■ ■ ■ , <7j_i and its lead 
member £j, so the inequality (22) takes the form of the first inequality, and so on. 
Meanwhile, 

f(G^;g\ r --,gi_ 1 ;e i ,4)-f(G^-,gl---,gi_ 1 -£ i ), 
f(G( i - 1 hg\r--,9U>bAA)-f(G«- 1 hgir--,9U> e i>%), 



f(G«-V; g\, ■ ■ • , i u k{ , . . . k{) - /(G^H, A, ■ ■ ■ flti); 

forms a solution to another polymatroid, from its property, we can conclude that, 

d{A j )> f{G^-gl---J j _ l J j ^ i )-f{G^-g\ r --J j _ l -A%i i ) 

where A c := Gf - A and A% = A c n q\. 
Therefore, for any A C Gf +1 , we have 

d{A) > f(G^;A,£ i+1 ) - /(G(*Vi+i)- (27) 

x 4+1 + d(A) 
= /(G«;G m )-/(GW)-d(A c ) 

< /(G«; G i+ i) - /(GW) - [/(G«;A c ,£ m ) - 

< /(GW;A,£ i+1 )-/(GW), 

where the first inequality follows from (27), applied to A c , and the second one follows from 
submodularity, i.e., 

f(G^;G i+1 ) - f(G^;A c ,£ i+1 ) < f(G^;A,£ i+1 ) - f(G^;£ i+1 ); 
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hence, the feasibility. 

□ 

Consider the 6-version of the performance polytope (in Definition 3) with side constraints: 

V'{b) = {x(A) > b(A),A CE; Xi < d u i e E}. (28) 

(For instance, di is the upper limit on the delay of type i jobs.) We want to solve the following 
LP problem: 

n 

min > axi. (29) 
v ' i=i 

Direct verification will establish that the Grouping Algorithm is readily adapted to solve the 
above problem: simply change / to b, and reverse the directions of all the inequalities involved. 
(Note that the side constraints are now Xj < di instead of X{ > di as in the /-version.) 

4.1 The Complexity of the Grouping Algorithm 

In general, the membership problem in Step 3 is as hard as the submodular function minimiza- 
tion itself. However, the special structure obtained through our construction of the subgroup 
will enable us to solve this problem much easily. The key is the following important observation, 

Lemma 7 In (26), only the side constraints and the constraint for the full set can be tight. 

Proof We prove this by induction. The case of two variables is obviously trivial. To illustrate 
the main idea, we first consider the case of three, then the general case. Now, suppose that 
subgroup g l j consists of two elements, k\ and k 2 - According to the Step 2 of the grouping 
algorithm, we have, 

f(G^;g\, • • • ,4_!) + d k . < f(G^;g\, ■ ■ ■ ,g)_ x , kj),j = 1,2 (30) 

f(G^- 1 \g\ r --,gi_ 1 ,k j ,k e )-f(G( i - 1 \g\,---,gi_ 1 ,k j )<d ke ,j,e=l,2. (31) 
Then, adding variable k% that satisfies, 

f(G^;g\, • • • , g)^) + d kz < f(G^;g\, • • • , g)_^ k 3 ), 

and 

f(G^;g\, ...,gi) + dk3 > f(G^;g\, ■■ -,g}, k 3 ). 
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First, we know that, 

f(G^;g\, ■ ■ ■ , k 3 ) + d kj < /(G^ 1 ); g[, ■ ■ • , g)_ v fc 3 , fc,),j = l,2 

can not be be valid for both j = 1,2, since in this case, £3 will form a subgroup of its own 
according to Step 2 of the grouping algorithm. From the fact that we have at least one j*, such 
that, 

f(G^;g\, • • • , fc 3 ) + d k .. < f(G^-g\, • • • , ^ fc 3 , ), 

it implies that x ks < 5(^3) can not be tight, for otherwise the solution is infeasible. Now 
consider the subsets of {k\,k 3 } and {k 2 ,ks}, they can not be tight, either. Otherwise, suppose 
that {^1,^3} is tight, then 

Xk 2 < g{h, k 2 , h) - g(ki, k 3 ) < g(k u k 2 ) - g{h) < d 2 

which leads to infeasibility of the problem. Here the first inequality is due to submodularity 
and the second one is just (31). Similar arguments apply to {^25^3}- 

For the general n case, we can repeat the similar argument. Suppose that we have a k n 
added into the subgroup gj. Then again, due to the fact that k — n is admitted into the 
subgroup, the following inequality will hold for some existing member of g^, j*, 

f(G^;gl---,d^k n ) + d k ^ <f(G^;g\,---,g}_ 1 ,k n ,k r ), 

which implies that x kn < g(k n ) can not be tight. For all the subset contains k n , but not the full 
set, the above submodularity argument can be employed again to show that the corresponding 
inequality can not tight. For those do not contain k n , the induction hypothesis tells us that 
they can not be tight. Hence, the proposition follows. □ 

Lemma 8 The complexity of Step 3 is 0{LP(n)). Where LP(n) denote the complexity of 
solving a linear program with no more than n constraints and n variables. 

Proof By complementary slackness, in the dual of (26), that is, 

max J2y A g(A) - ^d ki z k ^s.t. ^ y A - z kl < 1, ^ y A - z ki < 0,\/ki / fci, (32) 
A ki ABki A^ki 

the variables that corresponds to constraints other than the side constraints and the one for 
the full set equal to zero. Therefore, to solve the dual problem, we only need to solve a 
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linear program with i + 1 variables and £ constraints. That gives us the complexity bound of 
0{LP(n)). In fact, to check whether the original problem is feasible, we simply need to rule 
out the unboundness of the dual problem. □ 

To characterize the computational effort of the entire grouping algorithm, consider the point 
where we started Step 1 of the algorithm, i.e., when the n-th element is added to the m — 1 
groups that have already been formed for the problem of n — 1 elements. Note that throughout 
the algorithm, whenever an element moves, individually or in group, it always moves into a 
higher-priority (lower-indexed) group, or to a higher-priority within a group; and the very first 
subgroup is invalid, an entire group is eliminated. (Indeed, the number of groups can only 
decrease.) Hence, for an element originally in the fc-th group, the number of moves can be 
no greater than total number of subgroups ahead, with each move involving a fixed number 
of operations associated with checking the inequalities in (21), (22), (23), and (24)), and solving 
an LP. Therefore, to finalize the grouping with the additional n-th element, the computational 
effort involved is 0(n 2 x LP(n)). Since we (inductively) bring in the n elements one at the 
time, the overall effort of the grouping algorithm is 0(n 3 x LP{n)). 

4.2 Connections to the Queueing Scheduling Problem 

Return to the queueing scheduling problem considered in §2.2), with the side constraints. 

Theorem 9 Consider the control problem in (7), with additional side constraints, Xi > di, 
i G E, and suppose the Cj's follow (10). Solve the LP in (9). Let G\, . . . , G m be the groups 
formulated by applying the grouping algorithm. Then, the optimal control u* is a priority 
rule among the groups, with G\ having the highest priority, and G m the lowest priority; and 
within each group, u* is a randomized rule that enforces the side constraints (on the non-lead 
variables). 

For the last statement in the above theorem, reason as follows. With side constraints, the 
optimal solution x* to the LP in (9) returned by the grouping algorithm is an interior point 
of the original polymatroid V(f) (i.e., without the side constraints). Following Caratheodory's 
theorem (refer to, e.g., Chvatal [5]), x* can be expressed as a convex combination of no more 
than n vertices of V(f), each of which corresponds to a priority rule. (Note that the equality 
x{E) = f{E) has reduced the degree of freedom by one; hence, n, instead of n + 1.) In other 
words, x* can be realized by a control that is a randomization of at most n priority rules, with 
the coefficients of the convex combination used as the probabilities for the randomization. 
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In terms of implementing the optimal control, one can do better than randomization. It is 
known (e.g., Federgruen and Groenevelt [13]) that any interior point of the performance space 
can be realized by a particular dynamic scheduling policy, due originally to Kleinrock [24, 25], 
in which the priority index of each job present in the system grows proportionately to the time 
it has spent waiting in queue, and the server always serves the job that has the highest index. 
This scheduling policy is completely specified by the proportionate coefficients associated with 
the jobs classes, which, in turn, are easily determined by the performance vector (provided it 
is at the interior of the performance space). In terms of practical implementation, there are 
several versions of this scheduling policy, refer to [16, 17]. 

5 Submodular Function Minimization 

Suppose we want to minimize a submodular function ip(A), A C E. Following [7], we can 
transform this into an equivalent problem as follows. Define 

di:=iP(E-{i})-iP(E), ieE; 

and write d(A) = J^ieA^i- Let 

f(A):=^(A)+d(A)-^). (33) 

Then, minimizing 

f(A)-d(A)=^(A)-m 

is equivalent to minimizing the original ip(A). Clearly, /(0) = 0, and / is submodular. To see 
that / is also increasing, consider A C E, and i A. We have: 

f(A + {i})-f(A) = ^{A + {i}) + d(A)+d l -^{A)-d(A) 
= ^(A + {i})-i/;(A)+il;(E-{i})-^(E) 

> o, 

where the inequality follows from the submodularity of if). 

Hence, the minimization of a submodular function ip(A) over A C E is equivalent to the 
following problem: 

mm{f(A)-d(A)}. (34) 
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To solve this problem using the grouping algorithm of the last section, we still need (i) dj > 
for all j € E, and (ii) f(A) > d(A) for all A C E (refer to Lemma 2). 

Consider (i) first. If dj < 0, for some j, then, for any set A C E, due to the submodularity 
of tp, we have, for any negative dj, 

0>dj = ip(E - {j}) - 4(E) > 4(A - {j}) - 4(A), (35) 

implying — {j}) > That is, any minimizing set A will not contain j if dj < 0. Hence, 

we can redefine the set E by first excluding such fs. This way, we can effectively assume dj > 
for all j. 

The second requirement, f(A) > d(A), following (33), is equivalent to ip(A) > ^(0) for all 
A. To overcome this difficulty, we shall assume ip{A) is bounded from below: there exists an 
M > such that ?p(A) > — M for all ACE. Then, we can replace tp by ip' defined as follows: 

4'(A)=4(A), A^%; 4'($) = -M; 

and ip'{A) > ■0'(0) for all A C E. We then solve the minimization problem in (34), but over 
E — (instead of E). Once the minimizing set A* is identified, we can compare 4(A*) with 
-0(0) to determine whether A* or is optimal. 

Theorem 10 Let G\ be the first group formulated from applying the grouping algorithm 
(under the permutation ir = (1, ...,n)). (Recall, x\ is the lead variable in G±.) Then, for any 
B ^ G\, with 1 £ 5 C £, B is inferior to G\ in terms of yielding a larger objective value in 
(34). 

Proof. First, suppose B C G\. From Lemma 4, we have 

f(B)>x(B), f(G 1 ) = x(A), 

where x denotes the variable of the LP over V(f) as in the last section. Hence, 

/(d) - f(B) < x(G ± ) - x(B) = d(Gi) - d(B), 

where the equality follows from the fact that IgBcGi; hence, G\ — B only contains non-lead 
variables. 

Next, suppose B D G\. Write B = G\ + A, where A C E — G\. We have, following Lemma 

?? 
• • ? 

f(B) = f(A + Gi) > x(A) + x(G 1 ) = x(A) + f(G l ). 
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Hence, 

f{B) - /(Gi) > x(A) > d(A) = d(B) - did). 
Now, consider a general B. From submodularity, we have 



/(Gi) + /(B) > /(Gx U 5) + /(Gi n B). 



Hence, 



f(B)-d(B) 

> f(G 1 UB) + f(G 1 DB)-f(G 1 )-d(B) 

= fid U B) - did UB) + fid n B) - did n B) - fid) + did) 

> 2[fid)-did)]-fid)+did) 

= fid) -did), 

where the first equality follows from d(-) being additive, and the second inequality follows from 
the two cases already established above (regarding the subsets and supersets of G\). □ 
Following Theorem 10, we can conclude that after the set A = G\ is identified we do not 
have to consider the element 1 any more: any set that includes it must yield a larger objective 
value in (34) than G\. To obtain the solution to problem(34), let us add another variable {0} 
into the problem, define do = —1 and f(A U {0}) = fiA), this again defined a submodular 
function on E = E U {0}. In the running of the algorithm, give the highest priority, then it 
becomes the lead variable in Gi, hence the algorithm will produce, 

min {f(A)-d(A)}. 

ACE-{0} 

However, from the definition, we know that will never belongs to the minimum, hence, this 
is the optimum of (34). 

Remark 1 Another way to view Theorem 10 is to consider the minimization problem in (34), 
with the restriction that A 3 1, i.e., 

^{/(AJ-dCA-^})}. (36) 

Note that we have dropped the —d\ term in (36). The problem in (36) can be related to the 
LP over P'(f), with the following cost coefficients: 

ci = 1, c 2 = • • • = c n = 0. 
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Then, (36) becomes the same as the dual LP problem in (16), with ya 1 = 1, and i/a = for 
all other A = G^>\ j = 2, ...,m; = 1, kj € Gf, and all other z« = 0. Hence, the grouping 
algorithm will return A = G\ as the optimal solution to (36). 

This also suggests an alternative approach to the SFM in (34): for i = l,...,n, solve 
mmi £ ACE{f(A) — d{A — {i})}, by applying the grouping algorithm to the LP over V'(f), with 
Cj = 1 and Cj = for j ^ i. Each results in a subset G\ 3 i. Clearly, only these n subsets need 
to be considered; hence, we can compare these in terms of their corresponding objective values 
in (34), and pick the the smallest one. □ 

Remark 2 The starting point of virtually all existing SFM algorithms in the literature (in 
particular, those overviewed in the introductory section) is to relate the problem in (34) to the 
following equivalent LP: 

max{:r(£) : x € V(f), x { < d i: i G E}, (37) 

where V(f) is the polymatroid in (1). (The equivalence between (34) and (37) is established 
in [10]; also refer to [27].) The above LP is then solved via finding augmenting paths in an 
associated graph. In this approach, the key step is to make sure that any improvement on the 
current solution y remains within the (base) polymatroid £>(/), and this is accomplished via 
Caratheodory's representation of x (as a convex combination of the vertices of the polymatroid). 
As commented by McCormick in [27], "this is a rather brute-force way to verify that y £ £>(/). 
However, 30 years of research have not yet produced a better idea." 

In contrast, as explained in Remark 1, our approach is to solve the SFM in (34) via n 
subproblems such as the one in (36), which corresponds to the dual of an LP over V'(f ). Note 
that V'(f) is different from the polytope in (37) — the inequalities in the side constraints are 
in the opposite direction. This subtle distinction turns out to be crucial: While the grouping 
algorithm solves the LP over V'(f), it is not clear how the algorithm can be adapted to solve the 
LP in (37). Recall, the intuition behind the grouping algorithm is that each job class associated 
with a non-lead variable, a non-contributor to maximizing the objective function, is moved into a 
higher-priority group just to meet the lower limit on its performance (throughput) requirement. 
Should the side constraints become upper limits, the structure of the solution would have to be 
completely different. □ 
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